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5 August 1963 


PROGRAMS AND PLANS FOR ACTION TO ACCOMPLISH 
UNITED STATES FOREIGN INTELLIGENCE OBJECTIVE NUMBER 3 

INFORMATION PROCESSING TECHNIQUES 

I a OBJECTIVE NO. 3 

That research he intensified to determine die usefulness of data processing 
techniques, including mechanised title or su m ma r y sentence permutation, to facilitate 
review assessment of the great volume of material that must be dealt with in the 
intelligence community. 

M. THE PROBLEM IN PERSPECTIVE 

f As of today, only man possesses the intelligence and insight to ask questions, 
to exercise Judgment, and to recognise a good solution when it turns up; Modern 
inf ormati on processing equipment and techniques , however , can be of substantial 
assistance to an intelligence analyst by drastically reducing the amount of time and 
effort required to search for the information he desires, to collate it with other 
pertinent information, to update it, to rearrange it in different ways to see if any 
patterns are developing, and to perform various typea of mathematical computations 
and logical structuring. The machine will never replace the intelligence analyst, but, 
if skillfully used, it can provide an extremely powerful tool of analysis. 

Research and development work in intelligence automation is currently at a high 
level and is expected to increase in the years immediately ahead. The rate cf progress, 
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will foe gonmd more by availability of lirat^nte r©E ; .»a tch wosfeti?s ib&a 

by fh® supply of funds. 

Om of the fundamental difficulties in applying automation t© iatolilfraoe prctol .cans 

is shat computer equipment la use today is foesically designed fear the processing of 

\ 

numerical date. Yet. most intelligence ABF efforts er® primarily mened with 
reoaipts proemhe, storage,, and retrieval of aoa^ntimertoal data. 

Morj-aumertcal information processing deals with the manipulation of ay^abelS;, 
meaafeags and decletaas in an essentially qualitative and descriptive manner. As 
opposed to the processing of numerical data ^ifeieh is characterised by it® fundamental 
wiianoe on arithmetic. she processing ©f n<m-*numerieal data Includes structuring, 
Worcfertog, indexing, searching, aortttkg or rearranging, saergteg, updating nod tfe« 

ilka. Present technique# for perform teg these tasks arc generally act formalised and not 
reducible to a consistent, logical notation, la short, the task ©if defining the problem 
so that it can be programmed on the computer is much more difficult in wm* numerical 


th.nn in amner&eal processing. 

HE, OPERATIONAL AND DEVELOPMENT PROGRAMS 

The intelligence community is presently organised so as to foe responsive to 
changes as a result of the current technological revolution in information handling. 


toteEfgease handling techniques is exemplified by the DOD. Here each of the three 


military departments sad the MSA fo&v® research and development organisations . BJA, 
. in turn, develops intelligence research and development requirements for the DOD 
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Bofife MSA aad DIA are responsible for ensuring that the projects of the military 
department® &r© operationally eomd and meet operational roquiromoats in the 
cryptologic and aon^oryptologie intelligence communities, respectively. Th® Director 
©f Defease Research and Engineering then coordinates and geidsa all of these efforts 
for the Secretary of Defense. Finally, DDR&E, MSA, DIA, State Dept., FBI, th® 
Military Departments and CIA have representatives ©s the Committee on Documentation 
CCODIB]) of the Patted State* Intelligence Board. COBIB, which is chaired by CIA, 
provides central coordination and leadership for intelligence data handing 
matters. Problems are brought up by members for review guidance 
irformatioa is disseminated about on-going activities. For example, 

w 

having determined the Mad for a survey of community information processing 
stabilities, CODIB organised fee Ststff for Community Monotfien Processing 
Study ((SCZP&ij to perform the survey with group members supplied by CODIB 
reprosontsttve agencies under CIA direction. Aim, COMB identified the problem of 
compromising emsnsifong from operating equipment and is resolving 

the problem on a eommuatty~wid© basis through development of standard operating 
specifications, design and production of equipment and operating usage agreements . 

During the past 11 years, as the Cold War has spread to additional areas and as 
communist security measures have tightened, th® u. S. Intelligence Community has 
been forced to rely toeroaslagly on mass ooUeetioe techniques-. This has resulted to 
enormous volumes of electronic, photographic, acoustic, and hard copy information 

^v«vhich have inundated our manual prooenstog resources. 
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Manual methods of proceaalng have proved to be incapable of exploiting thoroughly 
the volume and type of raw data collected in the following fields: EL1NT; targeting; 
photography; current missile* air* naval and ground forces activity; merchant ah typing 
activity; and biographic material. Because of this* in toe early 19S0°s efforts began 
in toe Intelligence Community to apply punch card techniques to oertain intelligence 
files* primarily in the targeting and document storage and retrieval areas. Each year 
since then* APD has been applied to a growing number of intelligence tasks. At present, 
operational and development programs are underway in toe following areas: 

A. Document Storage and Retrieval 

In 1958 the first automated document storage and retrieval system (MXN1CARDJ 
began to be implemented in Air Force Intelligence. Today a number of intelligence 
organisations possess machine supported document storage and retrieval systems 
tailored to their specific requirements. 

CIA operates a central Information control system in which about 275,000 
incoming items per year are indexed according to a comprehensive subject classification 
scheme developed by the Intelligence Community. The index record for each document 
is stored on punch cards and the output to the intelligence analyst in response to a 
search query is % bibliography of document titles prepared by partially automated means. 
The information reports themselves are recorded on microfilm and mounted in aperture 
cards from which reproductions are readily obtained in response to requests. 

The Air Force has conducted experiments at SAC, NORAD and toe Foreign 
^Technology Division (FTDJ in a very promising indexing system for intelligence 
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documents called "Key Wort in Context (KW1CJ Indexing." For many years, efforts 
have been underway to develop automa t ic indexing techniques based on key words. 

The problem has been to combine key words into meaningful key terms, since an 
effective indexing system is most frequently based on terms rather than on single 
words. One of the most promising approaches now appears to be the KWIC system by 
of which it is possible to retrieve a listing of all appearances of a given wort in 
intelligence documents, together with the words appearing immediately before and after 
thu t word in each appearance. Thus, by seeing the word in context one can more 
readily find the document he is searching for. Such a system will soon become opera- 
tional on a limited scale in the DiA Current Intelligence and Indications Center. 
w NSA has been conducting experiments in capturing titles of reports automatically 

and preparing therefrom permuted wort indexes. The first results of this effort have 
become available recently and appear to be very encouraging. fNSA's applications of 
EDP techniques are otherwise in areas not pertinent to this paper 4 

The Department of State is currently developing a pilot system in the area of 
Cuban affairs based on a computer and designed to produce permuted subject indexes 
for the analyst. At present, sentences describing the contents of approximately 5,000 
documents of an 88,000 document library on Cuba have been prepared and machine 
input is expected to begin shortly. 

In CIA a major systems study was initiated in 1962 to investigate the 
application of computers to the Agency's central document storage and retrieval 
system. Hie study is now completing an investigation into analyst needs and Is 
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expected to proceed with detailed system design, and implementation of initial segments 
over die next three years. The proposed system would oover all document sources of 
significance to the analyst, all categories of Information (persons, organizations, places, 
things, subjects}, and all geographic areas, with minimum handling during processing. 

It is expected to incorporate input-output devices to speed data transcription and file 
querying, large capacity memories for index data storage, and a large -capacity document 
storage system in micron-image form. 

Both the Army and the Air Force have sponsored research in automatic 
abstracting from intelligence documents. Under this concept, the ADP equipment 
automatically selects and prints significant sentences from a report. Attached as the 

SrOovering sheet of a report, the Auto Abstract provides a good summary of the full 
contents. 

B. Biographic Intelligence Support 

The WALNUT System, now at an advanced stage of development in CIA, is 
a large file on foreign personalities of oounterintelligence interest. Automated document 
storage and retrieval equipment has been installed, a sophisticated name *e arching 
technique has been designed, and development of very large random access computer 
memory equipment to store the entire biographic index is at an advanced stage. A 
proposal is under study to develop a name traoing communications network that would 
provide quick access to information on foreign personalities stored anywhere in the 
Intelligence Community. 

W 
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C. Military intelligence Support 

Targeting, aa we know it today, could not exist without computers to store, 
collate, and retrieve the detailed data on 110,000 targets world-wide. Computers have 
also proved their worth in assisting in the production of radar order-of-battle from 
ELINT; fat producing air, naval and ground forces order-of -battle; in following current 
air and merchant shipping activity; in compiling damage assessment information; in 
intelligence support to war gaming; and in performing various mathematical computations. 
Efforts are currently underway to extend ADP to such functional areas as Current 
Intelligence and Indications, Collection Assets Inventories, Ports and Harbors, Coasts 
and Landing Beaches, Foreign Railroad and Highway Transportation Systems, and 

^ Photographic Storage and Betfreval. 

j 

In toe late 1950 s toe Army began to develop a computer system for collating 

inooming fragmentary reports on foreign ground forces order-of-battle. Under this 

j 

system, detailed items of information on foreign military units, e.g. , names of unit 
commanders, unit designations, locations, will be organized and collated to establish 
Ml identification and movement of military units. In 1963, this system was transferred 
to ESA and is currently undergoing testing. 

D. Overhead Reconnaissance Support 

Computer support to overhead reconnaissance has increased significantly 
during the past two years in CIA and ESA. Computers are being used in the following 
areas: (1) determining what to collect; (2J) technical support of operations; (3) Information 
y, and technical support of photographic interpretation. The computer-supported system 
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huKnin* field. It brings pertinent date oa each target (what's known and wbat'a needed! 
to til. elbow of the photo Interpreter M he «am tee. new photography. infomiMta. 

g<Ue8d tm read0Bt '* Promptly incorporated In the record need to ..>.h» T a w 
target prlorlUea and to siqiport now • todies of the given target. 

Both the Navy and the Air Fore, have under development eeml-autonuted 

teotlort photo interpretation systems. Thee, are designed to provide the photo 

Interpreter at a console with the necaaary Terence material without the attendant 

talk that 1. usually aseociated with reference material,, the console, also perform 

certain standard mathematical computations for the photo interpreter. On.ofth.se 

sretem. 1 . currently Installed and undergoing testing on two of the Navy., aircraft 
^barriers „ 

E» Machine Translation 

All three Military Services snd CIA have been sponsoring research In 
machine translation of foreign language documents. The present level of Government 
support to machine translation la about $3 million annually. Both general and .pedal 
purpose equipment is being used for this purpose. 

Three current developments are especially significant. First, CIA plana 
to contract for equipment which should both augment trunslatloo capabilities mid yield 
valuable date on an operating machine translation ay stem, second, the Air Force 
program ha. reached the .tag. where the special equipment involved will be transferred 
from the contractor’s facility to an operational environment at FTD In Dayton, Ohio, 
bin equipment will soon permit meehlne-aeeleted translation of 190.000 word, of 
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material per day. Third, a Joint Advisory Group on Automatic Language Processing 
has bean formed, with membership from the National Science Foundation, Department 
of Defense elements, and CIA. This groqp has agreed to support jointly a central staff 
to coordinate Government -supported machine translation research, evaluate its 
results, and guide its course in order to accelerate progress In this field. 

F. Intelligence Support of Command and Control Systems 

A considerable amount of research and development work la currently 
underway to provide intelligence in machinable form to the various command and 
control systems. The latter include the entire range of facilities from the National 
Military Command Center and its Alternates, which will serve National Command 
W Authorities, all the way down to such taotical systems as the Naval Tactical Data 
System (NTDSJi in individual ships. Tim Intelligence provided must be tailored to 
the decision-making process of the command served over a wide spectrum of 
emergency conditions . 

IVo CRITICAL AREAS IN FUTURE RESEARCH 


Three broad areas of research of critical importance to Intelligence are being 
considered: <A) conversion of hard copy material Into machinable form; (Bj) development 
of massive random access memories; and CQ development of concepts and systems 
for son-numerical intelligence data structuring and processing. Each of these is 
discussed briefly below: 


A. Conversion of 


form. At present, the bulk 


of die Intelligence material processed chi ADP equipment is first formatted manually 


from free text, then key punched onto punch cards, verified, converted to magnetic tape 
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mad finally entered into the computer., Thin it a costly procedure In terms of money, 
personas! and time. 

For all intelligence ADJP applications which entail huge data bases of 
historical material or which are characterized by voluminous daily inputs, the cost 
of data conversion is so great as to be a major problem. This is especially true of the 
processing of data ©a foreign missile* air* ground, naval* and merchant shipping 
activity. It is also true of projects pertaining to foreign transportation systems* porta, 
and harbors* coasts and landing beaches and biographic information. It is apparent that 
ADF will never be applied to the bulk c$ the historical or current intelligence material 
until the data conversion problem Is solved. The solution may lie, in part, is 
Research efforts currently underway to develop m optical scanner which can recognize 
lower and upper case letters and digits, and automatically punch the text on cards or 
paper tapes. As for the formatting problem, the solution may lie in programming 
efforts designed to permit the processing In computers of material which is virtually 
in Ml text. 

raj) Developm ent of Massive Random Access Memories. Because of the limited 

U if ri it i — i m ii i I T wpy-M w g -p ggM ff gMgrag wac^owaw— *B<Br*zaMmi**t*urv*mmrtarT"x*tT7?xar*^t.iM*aatr:x*n*>x»fvAac3imMua&z?2BK* 

internal memory capacity of computers, data bases today are stored on punched cards, 
magnetite tape, disc files or drums. Hone of these are entirely satisfactory. Storage 
capacity on cards or magnetic tape is theoretically unlimited, but processing speeds 
are relatively slow. Processing speeds on disc files or drums are much faster but 
storage capacity is limited. 

W g~E"C„R~E~T 
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Due to Hie enormous volumes of material encountered In Intelligence 
processing, there is a strong requirement for massive random access memories. 

The largest typical internal random access computer memory today contains about 
300* 000 words. Disc memories containing three million words are just becoming 
commercially available and even larger ones - up to 15 million words - are in the 
development stage. The DOD is currently sponsoring several projects to develop 
increased capacity memories. 

In order to attain a real breakthrough In this area and achieve, say a multi- 
billion bit magnetic memory* great strides in batch fabrication techniques for both 
the magnetic elements and the semi-conductor circuitry are necessary. Superconductive 
thin film technology offers the possibility of a very large capacity memory because 
of its ideal storage and switching characteristics combinsd with its integrated 
miniature batch fabrication possibilities. 

Three of the most promising eon-numerical techniques being pursued at 
present are adaptive procedures, self -organizing procedures, and content- 
addressing procedures. All of these are dependent in varying degree on the 
development of what has came to be known as "associative memory te chniq ues . 11 

Under this concept, the computer selects information on the basis of content, 
rather than on the basis of location of information or of indexing. All of memory 
can thereby be interrogated at one time. Such content addressable, or associative, 
Nfhemories could make searching for information very simple and could either simplify 
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or make unnecessary such tasks as ordering, merging, sorting, and collating of 
information which require so much time in today's serial processing. 

Very little has been written either on associative techniques or on the 
organisation cl a data processor which includes an associative memory. At the 
present time a bibliography of such papers includes 1ms than 20 authors. In torn, very 
little of what has been written has been directly funded by the Government. Most has 
been company-* or university-sponsored. Current efforts are highly scattered and are 
not focused on any definable goal, to particular, there are as yet no theories of machine 
organization or of information processing adequate to guide hardware design efforts. 

As a result, associative memories are being designed and built with minimum 
^reliance on logical design concepts. Within BOD no projects aimed at development of 
data processing organization concepts, of techniques, or of applications for associative 
memories have been funded (although there are several hardware programs for 
construction of small associative memories]). Approximately $1 million is available 
in FY-64 for research in this area, however. 

V RECOMMENDATIONS 

A. Shortage of Research Personnel 

Intensification of basic and applied research in ABP techniques of intelligence 
interest will be dependent primarily on increasing the number of first-rate research 
workers in universities, laboratories, and private research institutes who are engaged 
in tasks such as those mentioned in Section IV above. The USXB should express to the. 
National Science Foundation Its concern over the acute shortage of skilled research 
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with the Foundation to determine what remedial action* mightb e taken. 

B. Interagency Mechanising for Cooperative Effort 

Present-day problems In handling information are not unique to die Intelligence 
Community . Major investigations are underway in many quarters, particularly in die 
scientific community. B is likely that vigorous action by the Federal Council on 
Science and Technology's Committee on Scientific Information could and should contribute 
in areas of importance to intelligence. The P8IB should maintain adequate del wife. 
inter-agency mechanisms engaged in research in inform ation handling techniques 
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